Python 爬虫

来源：互联网发布：java项目开发实例编辑：程序博客网时间：2024/06/04 17:41

图片获取

获取百度图片，例子：

"""获取网页图片"""class Demo:#获取网页信息    def getHtml(self,url):        page = urllib.urlopen(url)        html = page.read()        return html#匹配网页中的图片    def getImg(self,html):        #reg = r'src="(.*?\.jpg)" alt'        reg = r'"thumbURL":"(.*?\.jpg)"'        imgre = re.compile(reg)        imglist = re.findall(imgre,html)        x = 0        for imgurl in imglist:            urllib.urlretrieve(imgurl,'%s.jpg' % x)#保存到本地            x += 1spider = Demo()html = spider.getHtml("https://image.baidu.com/search/index?ct=201326592&z=&tn=baiduimage&ipn=r&word=%E5%A3%81%E7%BA%B8%20%E4%B8%8D%E5%90%8C%E9%A3%8E%E6%A0%BC%20%E7%BE%8E%E5%A5%B3&pn=0&istype=2&ie=utf-8&oe=utf-8&cl=2&lm=-1&st=-1&fr=&fmq=&ic=0&se=&sme=&width=&height=&face=0")print htmlspider.getImg(html)

阅读全文

0 0

python爬虫-->爬虫基础
[爬虫] Python爬虫技巧
Python爬虫
python 爬虫
python 爬虫
python 爬虫
python爬虫
Python爬虫
Python爬虫
python 爬虫
Python爬虫
python爬虫
python 爬虫
python 爬虫
python爬虫
python爬虫
python爬虫
python 爬虫
如何实现JS_MD5加密
plsql安装提示Warning: Some Oracle Net versions cannot...
sax解析过程图解
数组的slice()和splice()方法
python 实现差商
Python 爬虫
oracle控制文件-新增
java分支语句
Dragonboard 410c blueteeth-mic问题
spring mvc 攔截器跨域問題
Apache Spark 2.2.0 中文文档
oracle 11.2.0.4.0配置OEM
诗歌一我自倾杯，君且随意
centos6.5安装MySQL5.7及配置