Oracle 中使用fetch bulk collect into 批量效率的读取游标数据

来源：互联网发布：大麦盒子电视直播软件编辑：程序博客网时间：2024/04/25 21:57

fetch bulk collect into 的使用格式是：fetch some_cursor collect into col1, col2 limit xxx。col1、col2 是声明的集合类型变量，xxx 为每次取数据块的大小（记录数），相当于缓冲区的大小，可以不指定 limit xxx 大小。下面以实际的例子来说明它的使用，并与逐条取记录的 fetch into 执行效率上进行比较。
使用 fetch bulk collect into 获取游标数据

declare  
    
  --声明需要集合类型及变量，参照字段的 type 来声明类型   
  type id_type is table of sr_contacts.sr_contact_id%type;   
  v_id id_type;   
       
  type phone_type is table of sr_contacts.contact_phone%type;   
  v_phone phone_type;   
       
  type remark_type is table of sr_contacts.remark%type;   
  v_remark remark_type;   
    
  cursor all_contacts_cur is --用 rownum 来限定取出的记录数来测试   
     select sr_contact_id,contact_phone,remark from sr_contacts where rownum <= 100000;   
    
begin  
         
    open all_contacts_cur;   
    loop   
        fetch all_contacts_cur bulk collect into v_id,v_phone,v_remark limit 256;   
        for i in 1..v_id.count loop --遍历集合   
            --用 v_id(i)/v_phone(i)/v_remark(i) 取出字段值来执行你的业务逻辑   
            null; --这里只放置一个空操作，只为测试循环取数的效率  
        end loop;   
        exit when all_contacts_cur%notfound; --exit 不能紧接 fetch 了，不然会漏记录   
    end loop;   
    close all_contacts_cur;   
end;

使用 fetch into 逐行获取游标数据

declare
  
  --声明变量，参照字段的 type 来声明类型
  v_id sr_contacts.sr_contact_id%type;
  v_phone sr_contacts.contact_phone%type;
  v_remark sr_contacts.remark%type;
  
  cursor all_contacts_cur is  --用 rownum 来限定取出的记录数来测试
     select sr_contact_id,contact_phone,remark from sr_contacts where rownum <= 100000;
  
begin
      
    open all_contacts_cur;
    loop
        fetch all_contacts_cur into v_id,v_phone,v_remark;
        exit when all_contacts_cur%notfound;    
        --用 v_id/v_phone/v_remark 取出字段值来执行你的业务逻辑
        null; --这里只放置一个空操作，只为测试循环取数的效率
    end loop;
    close all_contacts_cur;
end;

执行性能比较

看看测试的结果，分别执行五次所耗费的秒数：

当 rownum <= 100000 时：
fetch bulk collect into 耗时：0.125秒, 0.125秒, 0.125秒, 0.125秒, 0.141秒
fetch into 耗时：                 1.266秒, 1.250秒, 1.250秒, 1.250秒, 1.250秒

当 rownum <= 1000000 时：
fetch bulk collect into 耗时：1.157秒, 1.157秒, 1.156秒, 1.156秒, 1.171秒
fetch into 耗时：              12.128秒, 12.125秒, 12.125秒, 12.109秒, 12.141秒

当 rownum <= 10000 时：
fetch bulk collect into 耗时：0.031秒, 0.031秒, 0.016秒, 0.015秒, 0.015秒
fetch into 耗时：                 0.141秒, 0.140秒, 0.125秒, 0.141秒, 0.125秒

当 rownum <= 1000 时：
fetch bulk collect into 耗时：0.016秒, 0.015秒, 0.016秒, 0.016秒, 0.015秒
fetch into 耗时：                 0.016秒, 0.031秒, 0.031秒, 0.032秒, 0.015秒

从测试结果来看游标的记录数越大时，用 fetch bulk collect into 的效率很明显示，趋于很小时就差不多了。
注意了没有，前面使用 fetch bulk collect into 时前为每一个查询列都定义了一个集合，这样有些繁琐。我们之前也许用过表的 %rowtype 类型，同样的我们也可以定义表的 %rowtype 的集合类型。看下面的例子，同时在这个例子中，我们借助于集合的 first、last 属性来代替使用 count 属性来进行遍历。

declare
  
  --声明需要集合类型及变量，参照字段的 type 来声明类型
  type contacts_type is table of sr_contacts%rowtype;
  v_contacts contacts_type;
    
  cursor all_contacts_cur is --用 rownum 来限定取出的记录数来测试
     select * from sr_contacts where rownum <= 10000;
  
begin
      
    open all_contacts_cur;
    loop
        fetch all_contacts_cur bulk collect into v_contacts limit 256;
        for i in v_contacts.first .. v_contacts.last loop --遍历集合
            --用 v_contacts(i).sr_contact_id/v_contacts(i).contact_phone/v_contacts(i).remark
            --的形式来取出各字段值来执行你的业务逻辑
            null; --这里只放置一个空操作，只为测试循环取数的效率
        end loop;
        exit when all_contacts_cur%notfound;
    end loop;
    close all_contacts_cur;
end;

关于 limit 参数

你可以根据你的实际来调整 limit 参数的大小，来达到你最优性能。limit 参数会影响到 pga 的使用率。而且也可以在 fetch bulk 中省略 limit 参数，写成

fetch all_contacts_cur bulk collect into v_contacts;

有些资料中是说，如果不写 limit 参数，将会以数据库的 arraysize 参数值作为默认值。在 sqlplus 中用 show arraysize 可以看到该值默认为 15，set arraysize 256 可以更改该值。而实际上我测试不带 limit 参数时，外层循环只执行了一轮，好像不是 limit 15，所以不写 limit 参数时，可以去除外层循环，begin-end 部分可写成：

begin
    open all_contacts_cur;
    fetch all_contacts_cur bulk collect into v_contacts;
    for i in v_contacts.first .. v_contacts.last loop --遍历集合
        --用 v_contacts(i).sr_contact_id/v_contacts(i).contact_phone/v_contacts(i).remark
        --的形式来取出各字段值来执行你的业务逻辑
        null; --这里只放置一个空操作，只为测试循环取数的效率
        dbms_output.put_line(2000);
    end loop;
    close all_contacts_cur;
end;

bulk collect 的其他用法(总是针对集合)

select into 语句中，如：

SELECT sr_contact_id,contact_phone BULK COLLECT INTO v_id,v_phone
     FROM sr_contacts WHERE ROWNUM <= 100;
dbms_output.put_line('Count:'||v_id.count||', First:'||v_id(1)||'|'||v_phone(1));

returning into 语句中，如：

DELETE FROM sr_contacts WHERE sr_contact_id < 30
    RETURNING sr_contact_id, contact_phone BULK COLLECT INTO v_id, v_phone;
dbms_output.put_line('Count:'||v_id.count||', First:'||v_id(1)||'|'||v_phone(1));

forall 的 bulk dml 操作，它大大优于 for 集合后的操作

fetch all_contacts_cur bulk collect into v_contacts;
forall i in 1 .. v_contacts.count
--forall i in v_contacts.first .. v_contacts.last
--forall i in indices of v_contacts --10g以上,可以是非连续的集合
insert into sr_contacts(sr_contact_id,contact_phone,remark)
    values(v_contacts(i).sr_contact_id,v_contacts(i).contact_phone,v_contacts(i).remark);
    --或者是单条的 delete/update 操作

0 0