Solaris SPARC: LMHB&nb…
来源:互联网 发布:施耐德m258编程软件 编辑:程序博客网 时间:2024/06/06 03:30
Slorai RAC 2节点,突然2个节点数据库都Down掉后重启了
alertASM.log中
Fri Apr 24 07:59:58 2015
Time drift detected. Please check VKTM trace file for moredetails.
Sat Apr 25 07:09:29 2015
LMD0 (ospid: 28470) waits for event 'ges remote message' for 173secs.
LMS0 (ospid: 28472) waits for event 'gcs remote message' for 175secs.
Errors in file/u01/app/grid/diag/asm/+asm/+ASM1/trace/+ASM1_lmhb_28476.trc
ORA-29770: global enqueue process LMD0 (OSID 28470) is hung formore than 150 seconds
Incident details in:/u01/app/grid/diag/asm/+asm/+ASM1/incident/incdir_376097/+ASM1_lmhb_28476_i376097.trc
Errors in file/u01/app/grid/diag/asm/+asm/+ASM1/trace/+ASM1_lmhb_28476.trc
ORA-29770: global enqueue process LMS0 (OSID 28472) is hung formore than 150 seconds
Incident details in:/u01/app/grid/diag/asm/+asm/+ASM1/incident/incdir_376098/+ASM1_lmhb_28476_i376098.trc
Sat Apr 25 07:09:40 2015
Sweep [inc][376098]: completed
Sweep [inc][376097]: completed
Sweep [inc2][376098]: completed
Sweep [inc2][376097]: completed
Sat Apr 25 07:09:40 2015
ERROR: Some process(s) is not making progress.
LMHB (ospid: 28476) is terminating the instance.
Please check LMHB trace file for more details.
Please also check the CPU load, I/O load and other systemproperties for anomalous behavior
ERROR: Some process(s) is not making progress.
LMHB (ospid: 28476): terminating the instance due to error29770
Sat Apr 25 07:09:41 2015
ORA-1092 : opitsk aborting process
Sat Apr 25 07:09:41 2015
License high water mark = 19
Instance terminated by LMHB, pid = 28476
USER (ospid: 27314): terminating the instance
Instance terminated by USER, pid = 27314
查看 TRC文件:
*** 2015-04-25 07:09:29.355
==============================
LMD0 (ospid: 28470) has not moved for 173 sec(1429916968.1429916795)
kjfmGCR_HBCheckAll: LMD0 (ospid: 28470) has status 6
==================================================
=== LMD0 (ospid: 28470) Heartbeat Report
==================================================
LMD0 (ospid: 28470) has no heartbeats for 173 sec. (threshold 150sec)
===[ Wait Chain ]===
Wait chain is empty.
==============================
Dumping PROCESS LMD0 (ospid: 28470) States
==============================
===[ System Load State ]===
===[ Latch State ]===
DB的alertlog:
Completed checkpoint up to RBA [0xb8f6.2.10], SCN:10260692317532
Sat Apr 25 06:36:18 2015
Archived Log entry 72173 added for thread 1 sequence 47349 ID0xee7d887 dest 1:
Sat Apr 25 06:38:36 2015
Incremental checkpoint up to RBA [0xb8f6.14a22f.0], current logtail at RBA [0xb8f6.163b8f.0]
Sat Apr 25 06:39:23 2015
Beginning log switch checkpoint up to RBA [0xb8f7.2.10], SCN:10260692457839
Thread 1 advanced to log sequence 47351 (LGWR switch)
Sat Apr 25 06:39:30 2015
Archived Log entry 72174 added for thread 1 sequence 47350 ID0xee7d887 dest 1:
Sat Apr 25 06:39:38 2015
Completed checkpoint up to RBA [0xb8f7.2.10], SCN:10260692457839
Sat Apr 25 06:44:32 2015
Beginning log switch checkpoint up to RBA [0xb8f8.2.10], SCN:10260692664815
Thread 1 advanced to log sequence 47352 (LGWR switch)
Sat Apr 25 06:44:40 2015
Archived Log entry 72175 added for thread 1 sequence 47351 ID0xee7d887 dest 1:
Sat Apr 25 06:44:50 2015
Completed checkpoint up to RBA [0xb8f8.2.10], SCN:10260692664815
Sat Apr 25 06:49:27 2015
opiodr aborting process unknown ospid (14579) as a result ofORA-609
Sat Apr 25 06:56:20 2015
opiodr aborting process unknown ospid (16234) as a result ofORA-609
Sat Apr 25 06:58:40 2015
Incremental checkpoint up to RBA [0xb8f8.1ab53b.0], current logtail at RBA [0xb8f8.1b02ca.0]
Sat Apr 25 07:06:08 2015
Beginning log switch checkpoint up to RBA [0xb8f9.2.10], SCN:10260693117305
Thread 1 advanced to log sequence 47353 (LGWR switch)
Sat Apr 25 07:06:14 2015
Archived Log entry 72176 added for thread 1 sequence 47352 ID0xee7d887 dest 1:
Sat Apr 25 07:09:40 2015
NOTE: ASMB terminating
Errors in file/u01/prod/db/diag/rdbms/prod/PROD1/trace/PROD1_asmb_9310.trc:
ORA-15064: communication failure with ASM instance
ORA-03113: end-of-file on communication channel
Process ID:
Session ID: 433 Serial number: 31
Errors in file/u01/prod/db/diag/rdbms/prod/PROD1/trace/PROD1_asmb_9310.trc:
ORA-15064: communication failure with ASM instance
ORA-03113: end-of-file on communication channel
Process ID:
Session ID: 433 Serial number: 31
ASMB (ospid: 9310): terminating the instance due to error15064
Sat Apr 25 07:09:40 2015
System state dump requested by (instance=1, osid=9310 (ASMB)),summary=[abnormal instance termination].
System State dumped to trace file/u01/prod/db/diag/rdbms/prod/PROD1/trace/PROD1_diag_9168.trc
Sat Apr 25 07:09:40 2015
opiodr aborting process unknown ospid (26330) as a result ofORA-1092
Sat Apr 25 07:09:41 2015
ORA-1092 : opitsk aborting process
Sat Apr 25 07:09:41 2015
opiodr aborting process unknown ospid (17901) as a result ofORA-1092
Sat Apr 25 07:09:41 2015
ORA-1092 : opitsk aborting process
Sat Apr 25 07:09:41 2015
opiodr aborting process unknown ospid (24588) as a result ofORA-1092
Sat Apr 25 07:09:41 2015
ORA-1092 : opitsk aborting process
Sat Apr 25 07:09:41 2015
opiodr aborting process unknown ospid (19128) as a result ofORA-1092
Sat Apr 25 07:09:41 2015
opiodr aborting process unknown ospid (19144) as a result ofORA-1092
Sat Apr 25 07:09:41 2015
opiodr aborting process unknown ospid (17899) as a result ofORA-1092
Sat Apr 25 07:09:41 2015
ORA-1092 : opitsk aborting process
Sat Apr 25 07:09:41 2015
ORA-1092 : opitsk aborting process
Sat Apr 25 07:09:41 2015
ORA-1092 : opitsk aborting process
Sat Apr 25 07:09:41 2015
opiodr aborting process unknown ospid (29116) as a result ofORA-1092
Sat Apr 25 07:09:45 2015
ORA-1092 : opitsk aborting process
Sat Apr 25 07:09:45 2015
License high water mark = 1879
Instance terminated by ASMB, pid = 9310
USER (ospid: 27351): terminating the instance
Instance terminated by USER, pid = 27351
Sat Apr 25 07:11:58 2015
Starting ORACLE instance (normal)
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
参考文档:
Applies to:
Oracle Database - Enterprise Edition - Version 10.2.0.4 to12.1.0.1 [Release 10.2 to 12.1]
Oracle Solaris on SPARC (64-bit)
Symptoms
Running Oracle GI/RAC 11gR2, 11.2.0.1 on a two node SPARC(64-bit) cluster.
LMBH terminated one of the instances.
The following errors were reported in the Alert log:
A review of the LMHB trace file shows that this issue isimpacting all the Lock Management BG Processes.
For each LM BG processes (LMON, LMD0, LMS0, LMS1 and LCK0 inthis case), the trace file shows information similar to thefollowing:
The same sort of output will be repeated for each BG process every20 seconds until we exceed the thresholds:
Note that in the trace output, the wait_idrelated to each of the BG processes is not changing throughout theLMHB trace file .
Hence in this example, all LMON 'waiting for event' reports inthe trace file reflect the same wait_id(3381334669 in this example)
Cause
A review of the Alert log shows that the previous instancestartup took place on October 12 2011.
The October 12th restart was also due to a similar instancetermination by the LHMB process.
Prior to October 12th, the previous instance startup wasFebruary 7th 2011.
Calculating the number of days between crashes we see that theytook place around once every 248 days.
The reported symptoms match Solaris SPARC specific bug 10194190.Please refer to the following article for additional information onthis issue:
Solution
The bug is fixed in 11.2.0.4 and 12.1.0.2
For other releases, check the link to Interim patchesin Doc ID 10194190.8 and if available, apply the patch for bug10194190.
Alternatively, schedule instance restarts to occur before 248days of instance uptime.
- Solaris SPARC: LMHB&nb…
- 更新 Oracle Solaris&nb…
- Lesson 39 Am I&nb…
- What do I need&nb…
- What do I need&nb…
- ACM: LA 3266 -&nb…
- Troubleshooting ORA-1555&nb…
- 【转】Attachment support&nb…
- 【原】Android DHCP&nb…
- 【原】Android DHCP&nb…
- AccessWebElements(jsp by&nb…
- [js]Uncaught RangeError:&nb…
- 【转载】Spring RMI&nb…
- 【原创】 MySQLdb.cursors&nb…
- csapeditorctrl getobject&nb…
- StringUtils中 isNotEmpty&nb…
- ORA-00845: MEMORY_TARGET&nb…
- ORA-1652: Unable To&nb…
- android studio安装app异常-DELETE_FAILED_INTERNAL_ERROR
- sqlserver 将查询的结果创建为新表
- ubuntu sudoer权限管理
- bad superblock on /dev/mapper/*
- 扩展欧几里得求逆元:
- Solaris SPARC: LMHB&nb…
- ORACLE RAC环境下读取序列乱序问题
- How to Move Table…
- 链接服务器使用OPENQUERY性能提升
- High "Library Cache&nb…
- Table '%s' i…
- ORA-1578 / ORA-26040&n…
- 100
- Solaris 查找一个目录下软链…