ORACLE11g 云上Data Guard环境备库down机恢复实战过程
来源:互联网 发布:ipad网络非常慢怎么办 编辑:程序博客网 时间:2024/05/21 22:15
1、Oracle备库down机
微软云上的oracle备库意外ping不通了,各种报警上来了,联系微软云后台工程师,他们看了下,大概是说文件损坏了,信息如下:
Checking all file systems.
[/sbin/fsck.ext4 (1) -- /] fsck.ext4 -a /dev/sda1
/dev/sda1: clean, 134941/1905008 files, 2416626/7608064 blocks
[/sbin/fsck.ext4 (1) -- /oracle] fsck.ext4 -a /dev/sdc1
fsck.ext4: No such file or directory while trying to open /dev/sdc1
/dev/sdc1:
The superblock could not be read or does not describe a correct ext2
filesystem. If the device is valid and it really contains an ext2
filesystem (and not swap or ufs or something else), then the superblock
is corrupt, and you might try running e2fsck with an alternate superblock:
e2fsck -b 8193 <device>
[FAILED]
*** An error occurred during the file system check.
*** Dropping you to a shell; the system will reboot
*** when you leave the shell.
Give root password for maintenance
(or type Control-D to continue):
系统在启动的过程中无法识别/dev/sdc1 的superblock,未能正常启动到login界面,sdc1的superblock损坏。
2、尝试界面恢复
给azure云的工程师建议,直接在管理界面关闭,然后再启动,试试看。不知道他们是否采纳了,过了3个小时候后,他们跟我说一句恢复了,oracle备库服务器已经启动起来了。
然后我远程登录进去,一看数据文件目录还在,就准备startup,不过startup报错,如下所示
SQL> startup;
ORACLE instance started.
Total System Global Area 1.1825E+10 bytes
Fixed Size 2217024 bytes
Variable Size 6106909632 bytes
Database Buffers 5704253440 bytes
Redo Buffers 11849728 bytes
Database mounted.
ORA-10458: standby database requires recovery
ORA-01196: file 1 is inconsistent due to a failed media recovery session
ORA-01110: data file 1: '/home/oradata/powerdes/system01.dbf'
SQL>
看到是书籍不一致,所以在启动的时候,尝试介质恢复失败,在system01.dbf这个数据文件上面恢复失败了。
3、启动归档传输
启动归档,必须在mount状态下执行:
SQL> alter database recover managed standby database using current logfile disconnect from session;
Database altered.
SQL>
4、实时观察日志应用
实时观察,查看主库备库归档传输完成:
# 主库查看:
SQL> select sequence#,applied from v$archived_log order by sequence#;
......
SEQUENCE# APPLIED
---------- ---------
16850 NO
16850 NO
16851 NO
16851 NO
16852 NO
16852 NO
16853 NO
16853 NO
16854 NO
16854 NO
16855 NO
SEQUENCE# APPLIED
---------- ---------
16855 NO
5600 rows selected.
SQL>
# 备库查看:
SQL> select sequence#,applied from v$archived_log order by sequence#;
......
---------- ---------
16844 NO
16845 NO
16846 NO
16847 NO
16848 NO
16849 NO
16850 NO
16851 NO
16852 NO
16853 NO
16854 NO
SEQUENCE# APPLIED
---------- ---------
16855 NO
5600 rows selected.
SQL>
但是归档传输完成了,只是表面从主库的归档日志已经完全拉到备库了,但是日志应用还没有结束,还在执行中,去alert后台日志就可以看到信息:
Mon Nov 21 19:35:31 2016
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16715_906253421.dbf
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16716_906253421.dbf
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16717_906253421.dbf
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16718_906253421.dbf
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16719_906253421.dbf
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16720_906253421.dbf
Mon Nov 21 19:35:41 2016
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16721_906253421.dbf
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16722_906253421.dbf
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16723_906253421.dbf
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16724_906253421.dbf
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16725_906253421.dbf
Mon Nov 21 19:35:58 2016
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16726_906253421.dbf
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16727_906253421.dbf
Mon Nov 21 19:36:09 2016
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16728_906253421.dbf
也可以查看dg状态,看到正在应用日志中
SQL> col value for a80
SQL> set linesize 2000
SQL> col name for a50
SQL> col value for a80
SQL> select name,value from v$dataguard_stats;
NAME VALUE
-------------------------------------------------- --------------------------------------------------------------------------------
transport lag +00 00:00:00
apply lag +02 01:23:55
apply finish time +00 00:06:31.000
estimated startup time 11
SQL>
等日志应用完成后,再查看后台alert log可以看到:
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16853_906253421.dbf
Mon Nov 21 19:50:21 2016
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16854_906253421.dbf
Mon Nov 21 19:50:31 2016
Media Recovery Log /oracle/app/oracle/flash_recovery_area/archivelog1_16855_906253421.dbf
Media Recovery Waiting for thread 1 sequence 16856 (in transit)
Recovery of Online Redo Log: Thread 1 Group 4 Seq 16856 Reading mem 0
Mem# 0: /home/oradata/powerdes/redo_dg_021.log
5、打开数据库
SQL> alter database open;
alter database open
*
ERROR at line 1:
ORA-10456: cannot open standby database; media recovery session may be in
progress
SQL> select name,open_mode from v$database;
NAME OPEN_MODE
--------- --------------------
POWERDES MOUNTED
SQL> alter database recover managed standby database cancel;
Database altered.
SQL> alter database open;
Database altered.
SQL> alter database recover managed standby database using current logfile disconnect;
Database altered.
SQL>
至此,oracle备库完全修复完成,主备数据库开始保持一致保持同步。
- ORACLE11g 云上Data Guard环境备库down机恢复实战过程
- ORACLE 11g 用Duplicate恢复Data Guard 备库详细过程
- Oracle11g Data Guard配置手册
- oracle11g ORA-01555 ON ACTIVE DATA GUARD
- Oracle11g Active Data Guard搭建、管理
- Oracle11g Active Data Guard搭建、管理
- Oracle Data Guard 环境配置
- DataGuard - Sun Cluster + Data Guard灾备环境演示步骤
- Oracle 11g Data Guard 物理备库快速配置指南(上)
- Data Guard 之表误操作恢复
- Windows环境下配置oracle 11gR2 Data Guard详细过程
- ORACLE11g 没有控制文件如何通过rman备份恢复数据的详细实战过程
- Oracle RAC + Data Guard 环境搭建
- Data Guard 环境修改主备库IP地址
- 如何搭建一个 Data Guard 环境
- Oracle RAC + Data Guard 环境搭建
- 如何搭建一个 Data Guard 环境
- Data Guard 环境修改主备库IP地址
- UML基础--设计模式基本概念
- Android 手机上传图片至javaweb服务器(servlet)
- RabbitMQ (二)工作队列
- 第二章 λ-矩阵与矩阵的 Jordan 标准形
- OpenCv2 学习笔记(6) Mat元素访问和效率比较
- ORACLE11g 云上Data Guard环境备库down机恢复实战过程
- nyoj 又见01背包
- 第十周项目2-二叉树遍历的递归算法
- 项目遇到的问题小记
- Threadlocal 源码分析与内存泄漏
- Redis 安装(一)
- 特征工程框架图
- like模糊查询的占位符使用
- CentOS6.5启停脚本例子