python 爬虫入门1 网页图片保存

来源：互联网发布：综合办公软件下载编辑：程序博客网时间：2024/05/16 15:50

coding=utf-8#coding=utf-8

import urllib
import re

def getHtml(url):
page = urllib.urlopen(url)
html = page.read()
return html

def getImg(html):
reg = r’src=”(.+?.jpg)”’
imgre = re.compile(reg)
imglist = re.findall(imgre,html)
x = 0
for imgurl in imglist:
urllib.urlretrieve(imgurl,’%s.jpg’ % x)
x+=1
return imglist

html = getHtml(“http://www.cocoachina.com/bbs/read.php?tid=182334&page=1“)

print getImg(html)

0 0

python 爬虫入门1 网页图片保存
python爬虫实战（1）抓取网页图片自动保存
python 网页爬虫+保存图片+多线程+网络代理
python 网页爬虫+保存图片+多线程+网络代理
Python网页图片爬虫
Python爬虫网页图片
Python爬虫抓取网页图片
python 爬虫获取网页图片
Python 爬虫：获取网页图片
Python爬虫抓取网页图片
Python 爬虫网页抓图保存
python爬虫抓取图片入门
python网络爬虫，抓取网页图片
[python][爬虫]从网页中下载图片
python 爬虫获取网页中的图片
【python爬虫】百度贴吧帖子图片批量保存爬虫
Python入门简单的静态网页爬虫
python使用HTMLParser保存网页图片
windows下使用安装使用redis
oracle数据库-表空间不一样导致blob等大字段导入失败问题
Servlet技术（五）--防止页面被客户端缓存
imx6 读取芯片唯一码
php 格式转化
python 爬虫入门1 网页图片保存
Activity之间传递Bitmap方式
SPOJ 1716 GSS3 Can you answer these queries III 线段树区间合并
五星评分
学习笔记——继承体系中类的初始化顺序
PHP显示POST过来的所有数据
虚拟化小知识
先进先出(FIFO)置换算法
java获取get，post参数