Python3抓取页面图片

来源：互联网发布：c语言数据库编程编辑：程序博客网时间：2024/05/16 19:11

import urllib
import urllib.request
import re

def getHtml(url):
    page=urllib.request.urlopen(url)
    html=page.read()
    return html

def getImg(html):
    html=html.decode('utf-8')
    reg=r'src="(.+?\.jpg)"'
    imgre=re.compile(reg)
    imglist=re.findall(imgre,html)
    return imglist

html=getHtml("http://creativedreams.me/#modal-genius-hunt")

print(getImg(html))

因为python3中findall数据类型用bytes类型，因此应在正则表达式成使用类型转换。

0 0

Python3抓取页面图片
Python3 - 抓取静态页面（图片）
Python3抓取网页图片
python3抓取百度图片
Python3 抓取网页中的图片
Python3 抓取网页中的图片
Python3.4.4抓取网页图片
python3抓取糗百图片
python3抓取糗百图片
python3 抓取网页自有图片
Python3 爬虫--批量抓取图片
Python3抓取中文页面显示问题
简单的python3 urllib3 多线程抓取图片
Python3 网络爬虫之抓取图片
Python3简单爬虫抓取网页图片
抓取jsp页面生成图片
网络爬虫：抓取页面图片
python 爬虫抓取页面图片
消除忧虑的万能公式
point (出处不明)
深入了解当前ETL中用到的一些基本技术
UDP与TCP的区别
EM算法 The EM Algorithm
Python3抓取页面图片
Android 来电拦截的开发实现
根路径映射
mybatis分页
Android 睡眠流程
SQL统计1-12月的数据，没有数据的月份显示为0
Fedora20：1-硬盘
Sicily 2712. 继承与多态
iphone 文件操作