python图片小爬虫

来源:互联网 发布:战地2小队数据 编辑:程序博客网 时间:2024/04/26 07:07
import reimport urllibimport osdef rename(name):     name = name + '.jpg'    return name  def getHtml(url):    page = urllib.urlopen(url)    html = page.read()    return htmldef getImg(html):    reg = r'src="(.+?\.jpg)" pic_ext'    imgre = re.compile(reg)    imglist = re.findall(imgre,html)            os.chdir("E:\\pic")      os.getcwd()     x=1    for imgurl in imglist:        img=urllib.urlopen(imgurl)                          name=str(x)          name = rename(name)          print(name)         x=x+1                f=open(name,'wb')        f.write(img.read())         f.close()       html = getHtml("http://tieba.baidu.com/p/3553148164")getImg(html)print 'pic save!'    



爬取的网页是  http://tieba.baidu.com/p/3553148164

图片保存在E盘pic文件夹下


爬取结果如下:


1 0