python爬虫实战（1）抓取网页图片自动保存

来源：互联网发布：mysql笛卡尔乘积编辑：程序博客网时间：2024/04/30 03:04

随便抓取个桌面吧的图片。网址如下：http://tieba.baidu.com/p/2970106602

找到源代码中的图片网址，由正则表达式可构建出规则：rule=r‘src="(.+?\.jpg)" pic_ext’

代码如下，简单明了

import reimport urllib.requesturl='http://tieba.baidu.com/p/2970106602'data=urllib.request.urlopen(url).read().decode()#读取并解码，默认应该是utf-8?rule=r'src="(.+?\.jpg)" pic_ext'compiled_rule=re.compile(rule)list1=re.findall(compiled_rule,data)x=1path='d://python//grab//photo'#构建本地保存路径for element in list1:    pathnew=path+'//'+str(x)+'.jpg'    urllib.request.urlretrieve(element,pathnew)    x=x+1

最后效果：

0 0

python爬虫实战（1）抓取网页图片自动保存
Python爬虫抓取网页图片
Python爬虫抓取网页图片
Python爬虫实战（1）——百度贴吧抓取帖子并保存内容和图片
python 爬虫入门1 网页图片保存
java爬虫实战简单用Jsoup框架进行网页爬虫（如抓取网页图片）
python网络爬虫，抓取网页图片
python网络爬虫（1）--抓取图片
爬虫抓取网页图片
爬虫抓取网页图片
java爬虫实战（1）：抓取信息门户网站中的图片及其他文件并保存至本地
python爬虫抓取图片
python网络爬虫系列(四) --- 批量抓取并保存图片
Python 爬虫抓取美女图片保存到本地
python爬虫之抓取网页中的图片到本地
Python爬虫学习笔记一：简单网页图片抓取
第一个python程序，小爬虫--抓取网页图片
Python网络爬虫（6）糗事百科图片抓取按主题名保存
popViewControllerAnimated 无效的问题解决
Mac install sublime3 and PackageControl plugin
程序编译与代码优化-晚期（运行期）优化
HTML中select标签年月日的设定
Tomcat服务器
python爬虫实战（1）抓取网页图片自动保存
Snacker 覆盖 FloatingActionButton 的问题
大部分程序员每天只有10-12行代码能进入最终软件产品
如何在前端开发中增加编码效率，这里有十款 Chrome 扩展可以帮你
1.1.3 计算机网络的功能
POJ 3276 Face The Right Way（开关转换）
CSRF verification failed. Request aborted.
CF－Educational Codeforces Round 15－A－Maximum Increase
常见的dos命令