ruby+nokogori抓取糗事百科前10頁並存儲進資料庫示例

来源:互联网 发布:网络推广部门口号 编辑:程序博客网 时间:2024/05/29 13:37

ruby+nokogori抓取糗事百科前10頁並存儲進資料庫示例





#encoding:utf-8require "open-uri"require "nokogiri"require "mysql"@dbh=Mysql.real_connect("localhost","root","dengli","pachong")@file=open("qb.txt","w")def getQiubai(url)data=open(url){|f| f.read}doc=Nokogiri::HTML(data)doc.css('div.block.untagged.mb15.bs2').each{|p|content=p.css('div.content').text@file.puts("#{content}\n")img=p.css('div.thumb img').each{|img|imgAddr=img.attr('src') @file.puts("#{imgAddr}\n")time=Time.nowsql="INSERT INTO qiushibaike (`body`,`img`,`time`) VALUES ('#{content}','#{imgAddr}','#{time}')"@dbh.query(sql)}}endfor i in 1..10@file.puts("這是第#{i}頁")url="http://www.qiushibaike.com/8hr/page/#{i}"getQiubai(url)end
0 0