Thank you John Benjamins, Joe Fletcher, Doug Hughes, Ayaz Anjum, Julio Carrasco and others I might has missed out for the responses. Everyone has been very helpful! :) It was a patch thing in the end (I think). I decided to power off the server and try doing a reconfigure boot again (in diagnostics mode in fact, to find the fans responding ok). Sun initially thought it's a failed CPU fan when the two patches didn't work. But looking at the log during booting, there were lots of picld errors from the start, things like "no such file or directory", "error running psvc_fan_fault_check_policy_0" on lots of thing..makes it sound like a dodgy boot in the first place. Sun recommended that sometimes powering off the server will help, before doing a boot -r. Now that I think about it, I have shutdown to single user mode, did the patch, before doing a reconfigure boot. Many thanks once again! Kath PS: Most responses I have applied to Sol8 rather than 9, but I attach them here anyways. Original Q: > > Dear managers, > > Some updates, it seems that John Benjamins had a similar problem in Solaris > 8...I don't see any similar patches for v880s with Solaris 9 though, other > than 113447-17 and 113573 that Sun pointed. With these two patches I can see > my memory information now, but prtdiag -v shows the following unusual > environmental status (similar to that mentioned in > http://sunportal.sunmanagers.org/pipermail/summaries/2003-June/004000.html): > as well as the same console errors by picld as mentioned before. > > > Fan Bank : > ---------- > > Bank Speed Status Fan State > ( RPMS ) > ---- -------- --------- --------- > CPU0_PRIM_FAN failed in picl_get_propval_by_name for fan speed > General system failure > > I guess I'll wait for more updated patches from Sun now for Solaris 9. > Thanks Joe Fletcher for pointing out for me to look for picld patches. Very > flaky indeed. > > Oh, 113573-05 recommends installing patches 113574-07 in the (I assume) > latest README, but the latter patch has been withdrawn. Oh well. > > > Original Q: > > > > > Dear managers, need your prompt help! > > > > I've been getting these errors in /var/adm/messages constantly since a > > reboot a machine, a Sunfire v880 running Solaris 9 Generic_112233-12 (due > to > > a power failure by the way) -- > > > > .... > > Jun 11 03:12:12 serv picld[93]: [ID 710302 daemon.error] I/O error > > Jun 11 03:12:13 serv picld[93]: [ID 478985 daemon.error] ERROR running > > psvc_fan_fault_check_policy_0 on CPU1_PRIM_FAN (249 > > 9992) > > Jun 11 03:12:13 serv picld[93]: [ID 710302 daemon.error] I/O error > > Jun 11 03:12:15 serv picld[93]: [ID 478985 daemon.error] ERROR running > > psvc_fan_fault_check_policy_0 on IO_BRIDGE_PRIM_FAN > > (2500216) > > Jun 11 03:12:15 serv picld[93]: [ID 710302 daemon.error] I/O error > > Jun 11 03:12:48 serv picld[93]: [ID 478985 daemon.error] ERROR running > > psvc_fan_fault_check_policy_0 on CPU0_PRIM_FAN (249 > > 9960) > > Jun 11 03:12:48 serv picld[93]: [ID 710302 daemon.error] I/O error > > Jun 11 03:12:49 serv picld[93]: [ID 478985 daemon.error] ERROR running > > psvc_fan_fault_check_policy_0 on CPU1_PRIM_FAN (249 > > 9992) > > Jun 11 03:12:49 serv picld[93]: [ID 710302 daemon.error] I/O error > > Jun 11 03:12:51 serv picld[93]: [ID 478985 daemon.error] ERROR running > > psvc_fan_fault_check_policy_0 on IO_BRIDGE_PRIM_FAN > > (2500216) > > .... > > > > > > In the logs during the reboot, the &quot;PS2 Device unplugged&quot; is the > last error > > picld gives...could this be a cause of the problem? -- > > .... > > May 30 20:37:57 serv eri: [ID 517527 kern.info] SUNW,eri0 : 100 Mbps full > > duplex link up > > May 30 20:38:00 serv last message repeated 1 time > > May 30 20:38:02 serv pseudo: [ID 129642 kern.info] pseudo-device: devinfo0 > > May 30 20:38:02 serv genunix: [ID 936769 kern.info] devinfo0 is > > /pseudo/devinfo@0 > > May 30 20:42:23 serv picld[93]: [ID 293134 daemon.error] Device PS2 > > unplugged > > May 30 20:42:50 serv fsck[164]: [ID 293258 user.error] libsldap: Status: 2 > > Mesg: Unable to load configuration '/var/ldap/ > > ldap_client_file' (''). > > May 30 20:42:50 serv last message repeated 3 times > > May 30 20:42:50 serv picld[93]: [ID 478985 daemon.error] ERROR running > > psvc_fan_fault_check_policy_0 on CPU0_PRIM_FAN (249 > > 9960) > > May 30 20:42:50 serv picld[93]: [ID 875627 daemon.error] No such file or > > directory > > May 30 20:42:51 serv fsck[164]: [ID 293258 user.error] libsldap: Status: 2 > > Mesg: Unable to load configuration '/var/ldap/ > > ldap_client_file' (''). > > May 30 20:42:51 serv last message repeated 5 times > > May 30 20:42:52 serv picld[93]: [ID 478985 daemon.error] ERROR running > > psvc_fan_fault_check_policy_0 on CPU1_PRIM_FAN (249 > > 9992) > > May 30 20:42:52 serv picld[93]: [ID 875627 daemon.error] No such file or > > directory > > May 30 20:42:53 serv fsck[164]: [ID 293258 user.error] libsldap: Status: 2 > > Mesg: Unable to load configuration '/var/ldap/ > > ldap_client_file' (''). > > May 30 20:42:53 serv last message repeated 2 times > > .... > > > > Running prtdiag shows the following, and the &quot;no memory&quot; part is > giving me a > > heart attack. Could this just be (from the logs above), an incomplete > boot? > > I am thinking of rebooting the machine and seeing if it will be the same, > or > > do you think it's something failing for sure? Many thanks in advance for > > reading. Will summarise. > > > > &gt;prtdiag -v > > System Configuration: Sun Microsystems sun4u Sun Fire 880 > > System clock frequency: 150 MHz > > Memory size: 8192 Megabytes > > > > ========================= CPUs > > =============================================== > > > > Run E$ CPU CPU > > Brd CPU MHz MB Impl. Mask > > --- --- ---- ---- ------- ---- > > A 0 750 8.0 US-III 5.4 > > B 1 750 8.0 US-III 5.4 > > A 2 750 8.0 US-III 5.4 > > B 3 750 8.0 US-III 5.4 > > > > ========================= Memory Configuration > > =============================== > > > > Logical Logical Logical > > MC Bank Bank Bank DIMM Interleave Interleaved > > Brd ID num size Status Size Factor with > > ---- --- ---- ------ ----------- ------ ---------- ----------- > > Cannot find any memory bank/segment info. > > > > ========================= IO Cards ========================= > > > > > > Bus Max > > IO Port Bus Freq Bus Dev, > > Brd Type ID Side Slot MHz Freq Func State Name > > Model > > ---- ---- ---- ---- ---- ---- ---- ---- ----- > > -------------------------------- ---------------------- > > I/O PCI 9 A 8 33 66 1,0 ok SUNW,m64B > > SUNW,370-4362 > > > > No failures found in System > > =========================== > > > > > > ========================= Environmental Status ========================= > > > > System Temperatures (Celsius): > > ------------------------------- > > Device Temperature Status > > --------------------------------------- > > CPU0 68 OK > > CPU1 73 OK > > CPU2 59 OK > > CPU3 61 OK > > MB 31 OK > > IOB 26 OK > > DBP0 28 OK > > > > ================================= > > > > Front Status Panel: > > ------------------- > > Keyswitch position: NORMAL > > > > System LED Status: > > GEN FAULT REMOVE > > [OFF] [OFF] > > > > DISK FAULT POWER FAULT > > [OFF] [OFF] > > > > LEFT THERMAL FAULT RIGHT THERMAL FAULT > > [OFF] [OFF] > > > > LEFT DOOR RIGHT DOOR > > [OFF] [OFF] > > > > ================================= > > > > Disk Status: > > Presence Fault LED Remove LED > > DISK 0: [PRESENT] [OFF] [OFF] > > DISK 1: [PRESENT] [OFF] [OFF] > > DISK 2: [PRESENT] [OFF] [OFF] > > DISK 3: [PRESENT] [OFF] [OFF] > > DISK 4: [PRESENT] [OFF] [OFF] > > DISK 5: [PRESENT] [OFF] [OFF] > > DISK 6: [ EMPTY] > > DISK 7: [ EMPTY] > > DISK 8: [ EMPTY] > > DISK 9: [ EMPTY] > > DISK 10: [ EMPTY] > > DISK 11: [ EMPTY] > > > > ================================= > > > > Fan Bank : > > ---------- > > > > Bank Speed Status Fan State > > ( RPMS ) > > ---- -------- --------- --------- > > CPU0_PRIM_FAN 1298089537 [ENABLED] OK > > CPU1_PRIM_FAN 1298089537 [ENABLED] OK > > CPU0_SEC_FAN 0 [DISABLED] OK > > CPU1_SEC_FAN 0 [DISABLED] OK > > IO0_PRIM_FAN 4000 [ENABLED] OK > > IO1_PRIM_FAN 3947 [ENABLED] OK > > IO0_SEC_FAN 0 [DISABLED] OK > > IO1_SEC_FAN 0 [DISABLED] OK > > IO_BRIDGE_PRIM_FANfailed in picl_get_propval_by_name for fan speed > > General system failure > > Power Supplies: > > --------------- > > > > Supply Status Fan Fail Temp Fail CS Fail 3.3V 5V 12V 48V > > ------ ------------ -------- --------- ------- ---- -- --- --- > > PS0 GOOD 9 4 3 5 > > PS1 GOOD 9 3 3 5 > > PS2 UNPLUGGED > > > > > > ========================= HW Revisions > > ======================================= > > > > System PROM revisions: > > ---------------------- > > OBP 4.5.6 2002/01/04 12:30 > > > > IO ASIC revisions: > > ------------------ > > Port > > Brd Model ID Status Version > > ---- --------------- ---- ------ ------- > > IB-1 unknown 8 ok 4 > > IB-1 unknown 9 ok 4 Responses: -- could be hardware, but most likely software/firmware 1) make sure you have the I/O board firmware patches installed. 111474-07 or 113312-02 2) make sure you have the picl patches installed 110849-15 (5.8) 113263-05 (5.8) 113447-17 (5.9) 108528-29 (5.8) 109873-25 (5.8) 110852-11 (5.8) 110845-03 (5.8) 110460-32 (5.8) (others may be needed for 5.9) -- I have come across a similar problem with V880, and cause of the problem was that one of the free internal SCS cable got stuck in the fan and was holding it from rotating. Just opening the side door and releasing the cable solved the problem. -- If you have applied the recommended and security patch bundles, make sure you also apply the platform specific patches. Probably something that patches libpiclfrutree or some of the other picl plugin libraries. <wanted to, but the particular patch was withdrawn) -- look this document of sunsolve web.... Bug Id: 4700972 Category: firmware Subcategory: obp State: integrated Synopsis: Varied errors on OpenBoot Stop-A (or other error) followed by boot Description: Customer encountered error messages with V880 SunOS 5.8, V880 Highly recommended and 110460-17, 110849-09, OBP 4.5.6. Jun 7 16:03:31 v4u-880f picld[71]: ERROR running psvc_fan_fault_check_policy_0 on CPU0_PRIM_FAN (2500736) Jun 7 16:03:31 v4u-880f picld[71]: I/O error Jun 7 16:03:32 v4u-880f picld[71]: ERROR running psvc_fan_fault_check_policy_0 on CPU1_PRIM_FAN (2500768) Jun 7 16:03:32 v4u-880f picld[71]: I/O error Jun 7 16:03:35 v4u-880f picld[71]: ERROR running psvc_fan_fault_check_policy_0 on IO_BRIDGE_PRIM_FAN (2500992) Jun 7 16:03:35 v4u-880f picld[71]: I/O error To reproduce: 1) Power ON 2) Stop-A immidiately 3) boot at ok prompt Work around: 1) Power ON 2) Stop-A immidiately 3) reset 4) init 0 5) boot Integrated in releases: 4.x.build_23 Duplicate of: Patch id: See also: Summary: -- ________________________________________________ Message sent using Dodo Internet Webmail Server _______________________________________________ sunmanagers mailing list sunmanagers@sunmanagers.org http://www.sunmanagers.org/mailman/listinfo/sunmanagersReceived on Wed Jun 16 20:17:12 2004
This archive was generated by hypermail 2.1.8 : Thu Mar 03 2016 - 06:43:31 EST